Comparing and Detecting Stationarity and Dataset Shift
نویسندگان
چکیده
Abstract Machine learning algorithms have been increasingly applied to spatial numerical modeling. However, it is important understand when such methods will underperform. are impacted by dataset shift ; modeling domains of interest present non-stationarities there no guarantee that the trained models effective in unsampled areas. This work aims compare stationarity requirement geostatistical concept shift. Also, workflow developed detect data prior modeling, this involves applying a discriminative classifier and two sample Kolmogorv-Smirnov test model And, required lazy modification support vector regression proposed account for The benefits algorithm demonstrated on well-known non-stationary Walker Lake improves root mean squared error up 25% relative standard SVR approach, areas where present.
منابع مشابه
Dataset Shift Adaptive Computation and Machine Learning
All rights reserved. No part of this book may be reproduced in any form by any electronic or mechanical means (including photocopying, recording, or information storage and retrieval) without permission in writing from this is the last candidate. next esc will revert to uncompleted text. he publisher. Overview Dataset shift is a challenging situation where the joint distribution of inputs and o...
متن کاملLearning under Non-Stationarity: Covariate Shift and Class-Balance Change
One of the fundamental assumptions behind many supervised machine learning algorithms is that training and test data follow the same probability distribution. However, this important assumption is often violated in practice, for example, because of an unavoidable sample selection bias or non-stationarity of the environment. Due to violation of the assumption, standard machine learning methods s...
متن کاملLearning under Non-stationarity: Covariate Shift Adaptation by Importance Weighting
The goal of supervised learning is to estimate an underlying input-output function from its input-output training samples so that output values for unseen test input points can be predicted. A common assumption in supervised learning is that the training input points follow the same probability distribution as the test input points. However, this assumption is not satisfied, for example, when o...
متن کاملDataset shift in PLDA based speaker verification
Dataset shift is a problem widely studied in the field of speaker recognition. Among the different types of dataset shift, covariate shift is the most common one in real scenarios. Traditional solutions for the problem of covariate shift have been developed in the context of channel and session variability, and make use of large datasets to train models for channel/session compensation. However...
متن کاملComparing the mental health of housewives and day shift working women in Yazd County, Iran, 2010
Background: There are studies reporting the adverse effects of working outside the house on women’s mental health. As the mental health is affected by the structure and culture of each society, this study was conducted to compare the mental health of housewives and working woman in Yazd county, Iran (2010). Materials and Methods: This cross-sectional study was done in Yaz...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Springer proceedings in earth and environmental sciences
سال: 2023
ISSN: ['2524-342X', '2524-3438']
DOI: https://doi.org/10.1007/978-3-031-19845-8_3